Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery

نویسندگان

  • Man-Hung Siu
  • Herbert Gish
  • Arthur Chan
  • William Belfield
  • Steve Lowe
چکیده

We present our approach to unsupervised training of speech recognizers. Our approach iteratively adjusts sound units that are ptimized for the acoustic domain of interest. We thus enable the use of speech recognizers for applications in speech domains here transcriptions do not exist. The resulting recognizer is a state-of-the-art recognizer on the optimized units. Specifically we ropose building HMM-based speech recognizers without transcribed data by formulating the HMM training as an optimization ver both the parameter and transcription sequence space. Audio is then transcribed into these self-organizing units (SOUs). We escribe how SOU training can be easily implemented using existing HMM recognition tools. We tested the effectiveness of SOUs n the task of topic classification on the Switchboard and Fisher corpora. On the Switchboard corpus, the unsupervised HMM-based OU recognizer, initialized with a segmental tokenizer, performed competitively with an HMM-based phoneme recognizer trained ith 1 h of transcribed data, and outperformed the Brno University of Technology (BUT) Hungarian phoneme recognizer (Schwartz t al., 2004). We also report improvements, including the use of context dependent acoustic models and lattice-based features, hat together reduce the topic verification equal error rate from 12% to 7%. In addition to discussing the effectiveness of the SOU pproach, we describe how we analyzed some selected SOU n-grams and found that they were highly correlated with keywords, emonstrating the ability of the SOU technology to discover topic relevant keywords. 2013 Published by Elsevier Ltd.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision

In our previous publication [1], we presented a new approach to HMM training, viz., training without supervision. We used an HMM trained without supervision for transcribing audio into self-organized units (SOUs) for the purpose of topic classification. In this paper we report improvements made to the system, including the use of context dependent acoustic models and lattice based features that...

متن کامل

Unsupervised training of an HMM-based speech recognizer for topic classification

HMM-based Speech-To-Text (STT) systems are widely deployed not only for dictation tasks but also as the first processing stage of many automatic speech applications such as spoken topic classification. However, the necessity of transcribed data for training the HMMs precludes its use in domains where transcribed speech is difficult to come by because of the specific domain, channel or language....

متن کامل

Unsupervised Audio Patterns Discovery Using HMM-Based Self-Organized Units

In our previous work [1, 2], we trained an HMM-based speech recognizer without transcription or any knowledge or resources. The trained HMM recognizer was used to transcribe audio into self-organized units (SOUs) and we evaluated its performance on the task of topic identification. In this paper, we report our work in applying SOUs to discover audio patterns in spoken documents without supervis...

متن کامل

Feature Selection with Kohonen Self Organizing Classification Algorithm

In this paper a one-dimension Self Organizing Map algorithm (SOM) to perform feature selection is presented. The algorithm is based on a first classification of the input dataset on a similarity space. From this classification for each class a set of positive and negative features is computed. This set of features is selected as result of the procedure. The procedure is evaluated on an in-house...

متن کامل

Self-organizing maps and its applications in sleep apnea research and molecular genetics

This paper presents the application of special unsupervised neural networks (self-organizing maps) to different domains, as sleep apnea discovery, protein sequences analysis and tumor classification. An enhancement of the original algorithm, as well as the introduction of several hierachical levels enables the discovery of complex structures as present in this type of applications. Furthermore,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Speech & Language

دوره 28  شماره 

صفحات  -

تاریخ انتشار 2014